A Simple Guide to Keyword Clustering with spaCy
dev.toยท9hยท
Discuss: DEV
๐ŸฐMedieval Parsing
Preserving the digital legacy of company archives: Last stop, Newhaven.
dpconline.orgยท21h
๐Ÿ’พData Preservation
WorldCat Editions and Holdings Release
annas-archive.orgยท1dยท
Discuss: Hacker News
๐Ÿ“šMARC Records
Lessons from using AI in Discovery
thoughtbot.comยท1d
๐Ÿ•ต๏ธMetadata Mining
<h2>Resurrected - Two Latin Texts</h2>
naomiceder.techยท4h
๐Ÿ”คFont Archaeology
Internet Archive Ends Legal Battle With Record Labels Over Historic Recordings
yro.slashdot.orgยท6h
๐ŸŽตAudio Archaeology
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท15hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
Decoupling Search and Learning in Neural Net Training
arxiv.orgยท1h
๐Ÿง Learned Indexing
Digital Forensics Jobs Round-Up, September 15 2025
forensicfocus.comยท14h
๐ŸšจIncident Response
Cracking Open the Worldโ€™s Largest Time Capsule
atlasobscura.comยท12h
๐Ÿ“ผCassette Archaeology
Call for Submissions: Public Services Quarterly
archivespublishing.comยท15h
๐Ÿ“šLibrary and Information Science
A Grateful Goodbye to FSU Special Collections & Archives
fsuspecialcollections.wordpress.comยท14h
๐ŸบFormat Archaeology
Unlock 'Magic' Optimization: Smarter Search When Blindfolded by Arvind Sundararajan
dev.toยท9hยท
Discuss: DEV
๐Ÿ”Search Indexing
Text-to-SQL Oriented to the Process Mining Domain: A PT-EN Dataset for Query Translation
arxiv.orgยท1d
๐Ÿ“‹Document Grammar
Show HN: I Built a Free Site for Students to Test Their Knowledge on Their Notes
pdftoquiz.comยท1dยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
The Risks of Code Assistant LLMs: Harmful Content, Misuse and Deception
unit42.paloaltonetworks.comยท7h
โšกProof Automation
Automated Data Lineage Reconstruction via Multi-Modal Graph Analysis & HyperScore Validation
dev.toยท10hยท
Discuss: DEV
๐Ÿ”—Data Provenance
Semantic Dictionary Encoding
falvotech.comยท14hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
So you have your data, but how does it relate to the physical world?
blog.mapped.comยท43mยท
Discuss: Hacker News
๐ŸŒŠStream Processing
The AI-Scraping Free-for-All Is Coming to an End
nymag.comยท1dยท
Discuss: Hacker News
๐Ÿ“ฐContent Curation